WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 
G06F 9/46 



Al 



(11) International Publication Number: WO 98/18076 

(43) International Publication Date: 30 April 1998 (30.04.98) 



(21) International Application Number: PCT/US97/ 19468 

(22) International Filing Date: 20 October 1997 (20.10.97) 



(30) Priority Data: 
08/733,516 



18 October 1996(18.10.96) 



US 



(71) Applicant: INTER VU, INC. [US/US]; 201 Lomas Santa Fe 
Drive, Solana Beach, CA 92075 (US). 

(72) Inventors: KENNER, Brian; 1403 Walnut Creek Drive, 
Encinitas, CA 92024 (US). KARUSH, Arnold; 3253-25 
Caminito Eastbluff, La Jolla, CA 92037 (US). 

(74) Agents: W1XON, Clarke, A. et al.; Darby & Darby, P.C., 
32nd floor, 707 Wilshire Boulevard, Los Angeles, CA 90017 
(US). 



(81) Designated States: AL, AM, AT, AU, AZ, BA, BB, BG, BR, 
BY, CA. CH, CN, CU. CZ, DE, DK, EE, ES, FI, GB, GE, 
GH, HU, IL, IS, JP, KE, KG, KP, KR, KZ, LC, LK, LR, 
LS, LT, LU, LV, MD, MG, MK, MN, MW, MX, NO. NZ, 
PL, PT. RO, RU, SD, SE, SG, SI, SK, SL, TJ. TM f TR, 
TT, UA, UG, UZ, VN, YU, ZW, ARIPO patent (GH, KE, 
LS, MW, SD, SZ, UG, ZW), Eurasian patent (AM, AZ, BY, 
KG, KZ, MD, RU, TJ, TM), European patent (AT, BE, CH, 
DE, DK, ES, FI, FR, GB, GR, IE, IT. LU. MC, NL. PT, 
SE), OAPI patent (BF, BJ, CF, CG, CI, CM, GA, GN, ML, 
MR, NE, SN, TO, TG). 



Published 

With international search report 

Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(54) Title: SYSTEM AND METHOD FOR OPTIMIZED STORAGE AND RETRIEVAL OF DATA ON A DISTRIBUTED COMPUTER 
NETWORK 

(57) Abstract 

A system and method for the optimized storage and retrieval 
of video data at distributed sites call for the deployment of "Smart 
Mirror" sites (26, 28, 30) throughout a network (10), each of which 
maintains a copy of certain data managed by the system. Every user ' 
(12, 16, 20) is assigned to a specific delivery site based on an analysts 
of network performance with respect to each of the available delivery 
sites. Generalized network performance data is collected and stored 
to facilitate the selection of additional delivery sites and to ensure 
the preservation of improved performance in comparison to traditional 
networks. 




Codes used to identify 



AL Albania 

AM Armenia 

AT Austria 

AU Australia 

AZ Azerbaijan 

BA Bosnia and Herzegovina 

BB Barbados 

BE Belgium 

BF Burkina Faso 

BG Bulgaria 

BJ - Benin 

BR Brazil 

BY Belarus 

GA Canada 

Central African Republic 
CG Congo 
Cll Switzerland 
CI C6te d'lvoire 
CM Cameroon 
CN China 
CU Cuba 
CL Czech Republic 
OK Germany 
DK Denmark 
EE Estonia * 



FOR THE PURPOSES OF INFORMATION ONLY 



ES Spain 

I?l Finland . 

PR France 

GA Gabon 

GB United Kingdom 

GE Georgia 

GH Ghana 

GN Guinea 

GR Greece 

HU Hungary 

IE Ireland 

1L Israel 

IS Iceland 

IT UaW 

jp Japan 

KE Kenya 

KG Kyrgyzstan 

KP Democratic people's 

Republic of Korea 

KR Republic of Korea 

KZ Kazakstan 

IX Saint Lucia 

U Liechtenstein 

LK Sri Lanka 

LR Liberia 



LS Lesotho 

LT Lithuania 

IX) Luxembourg 

LV Latvia 

MC Monaco 

MD Republic of Moldova 

MG Madagascar 

MK The former Yugoslav 
Republic of Macedonia 



ML 


Mali 


MN 


Mongolia 


MR 


Mauritania 


MW 


Malawi 


MX 


Mexico 


NE 


Niger 


NL 


Netherlands 


NO 


Norway 


NZ 


New Zealand 


PL 


Poland 


PT 


Portugal 


RO 


Romania 


RU 


Russian Federation 


SD 


Sudan 


SB 


Sweden 


SG 


Singapore 



SI Slovenia 

SK Slovakia 

SN Senegal 

SZ Swaziland 

TD Chad 

TG Togo 

TJ Tajikistan 

TM Turkmenistan . 

TR Turkey 

TT Trinidad and Tobago 
UA Ukraine 
UG Uganda 
US United States of America 
UZ Uzbekistan 
VN Viet Nam 
YU Yugoslavia 
ZW Zimbabwe 



WO 98/18076 



PCT7US97/19468 



SYSTEM AND METHOD FOR OPTIMIZED STORAGE 
AND RETRIEVAL OF DATA ON A 
DISTRIBUTED COMPUTER NETWORK 



10 



The invention relates to a system and method for 
distributed data storage and retrieval, and more 
particularly, to a system and method whereby a user can 
acquire network performance information for a dynamic and 
15 distributed multipurpose network, and use this information 
to identify and select optimum delivery sites or servers 
from which to receive computer data, specifically multimedia 
content. Such delivery sites and servers are selected so as 
to increase network capacity, distribute server load, and 
20 reduce transmission delays between the server and the user. 



BACKGROUND OF THE INVENTION 

The Internet is a loose network of connected computers 

25 spread throughout the world. A message can be sent from any 
computer on the Internet to any other by specifying a 
destination address and passing the message from computer to 
computer via a series of "hops." Each computer , router , or 
"node" on the Internet has a unique Internet address. When 

30 an intermediate computer or router receives a message in 

transit, the computer checks the intended destination of the 
message and passes it along accordingly . 

The Internet is growing, in terms of both size and 
sophistication, at a rapid rate. In the past, most users of 

35 the Internet were academic, research, or institutional 
users; the Internet was primarily used at that time to 
transmit and receive electronic mail and network news and to 
allow transfer of computer files. However, since the 
introduction of the World Wide Web (also known as the "Web*' 

40 or the "WWW") several years ago, the Internet has begun to 
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host increasing counts of other types of 

interest, namely representations of images, articles etc. 

The web protocol and language establish a graphical 
means to navigate the expanses of the Internet. "Web 
X*s,» often consisting primarily of text and ^ph.cal 
material, are stored on numerous colters, known as "Web 
severs .» throughout the Internet. A software program Known 
L a -browser- can be used to access and view Web pages 
across the Internet by specifying the location (i.e. 
internet address) of the desired Web page. When a Web page 
is accessed, its information is transmitted fro* the remote 
computer (server or delivery site, . wherever in the world it 
„y be located, across the Internet, to the user. _ 

in recent times, the Web has begun to host highly 
sophisticated types of multimedia content, such as audio and 
video data, and computer software. Compared to first 
deration Web content, namely text and still images, audio 
clips, video clips, and software programs have extremely 
high storage and bandwidth requirements. 
, At present, it is difficult, if not impossible, to 

provide sustained high-speed transmission of large 
ludio/video files over a multi-node link an the Internet. 
Because the data is often transferred from afar many 
factors can cause the delay or even loss of parts or all of 
, 5 a transmission. It is generally not critical if a user 

experiences minor delays in receiving small graphic or text 
Til However, it is recognized that real-time data such 
as video has very specific and stringent timing requirements 

for data transfer and display. 
3„ Unfortunately, the present design of traditional 

Internet-like data networks is based on the principle that 
delays and significant data transmission rate variations are 
acceptable for ordinary data (e.g. text and still images, 
consequently, because of the high value of permitting access 
35 ZtZ and graphical information from locations around the 
world, such transmission defects are considered acceptable, 
and the base capacity of the Internet is somewhat 
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"oversubscribed" to reduce data transmission costs. In 
other words, the timeliness of network data transmission has 
been significantly compromised in order to render relatively- 
insignificant the aggregate cost of long distance 

5 communication connections. 

In order to successfully transfer audio-video data 
across a message-oriented network such as the Internet, for 
any more than a few users, network resources should be 
committed in a manner facilitating timeliness of 

10 transmittal. A system using committed network resources 
generally cannot take advantage of the existing pricing 
scheme of shared networks like the Internet, since it cannot 
participate in the sharing of network resources on a data 
packet by data packet basis. Video data must be transmitted 

15 to the exclusion of lower-priority data. Transmission costs 
thus become significant, especially when the connection is 
"long distance" or when the connection is continued over an 
extended period of time. 

Another consequence of the timeliness vs. cost 

20 compromise discussed above has been the seemingly 

indiscriminate topographical design of the network. Since 
delays and throughput variations have traditionally been 
excused in favor of low cost, the configuration of the 
Internet inf rastructure has also been driven by cost 

25 considerations. Accordingly, the interconnection efficiency 
of the network has rarely been considered. The rapid growth 
of real time data is changing this requirement. 

It is recognized that inadequate data transfer 
performance of time-sensitive data on the Internet is 

30 typically caused by four factors: packet loss, excessive 
server utilization, the relatively low capacity of the 
network infrastructure , and inherent delays in the network 
hardware. Packet loss, in particular, is caused by 
inadequate infrastructure and lack of robustness in routing. 

35 The inherent delays are believed to be caused by, among 
other things, the lack of flow control between adjacent 
nodes in a multiple-node path on the Internet. 
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Unlike smaller text and graphic files, relatively large 
video files can take several minutes (or more) of 
••streaming," or constant data flow. Consequently, the usual 
network performance problems are exacerbated. Network 
5 bandwidth, or the data-carrying capacity of a particular 

network, is limited. Thus, packet loss and delays increase. 
Long delivery times consume a large amount of server 
capacity for a long time, decreasing the resources available 
to other users. Accordingly, because the network 
10 infrastructure becomes increasingly congested , packet loss 
and delays continue to increase, transmission times rise, 
and server load increases further . 

This pattern exemplifies a "downward spiral" of network 
performance, which can be driven by the attempted 
15 transmission of large data files such as video clips. As 

long as network traffic remains within the limits imposed by 
network bandwidth, network performance will remain 
acceptable. However, whenever peak network loads exceed 
capacity, the downward spiral described above will begin, 
20 causing increasing periods of poor network performance . 

As discussed above, a browser program can be used to 
access and view Web pages across the Internet by specifying 
the location (i.e. Internet address) of the desired Web 
page, or more commonly, by "hotlinking" to Web pages. 
25 Common browsers are Lynx, NCSA Mosaic, Netscape Navigator, 
and Microsoft Internet Explorer. The desired Web page is 
specified by a uniform resource locator ("URL") , indicating 
the precise location of the file using the syntax 
"http : / / internet . address/directory/ filename . html" . 
30 Web pages are generally described, in terms of layout 

and content, by way of a language known as "HTML" (HyperText 
Markup Language) . Any particular computer linked to the 
Internet can store one or more Web pages, i.e. computer 
files in HTML format, for access by users. 
35 Hotlinking from one HTML Web page to another is 

accomplished as follows. The user first accesses a Web page 
having a known address, often on the computer located at the 
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user's ISP (Internet Service Provider). The ISP is the 
organization providing Internet connectivity to the user. 
That Web page can contain, in addition to textual and visual 
data specified in HTML format, "links," or embedded 

5 information (in the form of URLs) pointing to the Internet 
addresses of other Web pages, often on other computers 
throughout the Internet. The user, by selecting a link 
(often by pointing and clicking with a mouse) , can then 
access other Web pages, which can in turn contain further 

10 data and /or additional links. 

Various extensions to HTML, such as Netscape's EMBED 
tag, allow references to other data to be embedded into Web 
pages. Some browsers are not capable of handling data other 
than text and images. Other browsers can handle the data in 

15 various ways. NCSA Mosaic, for example, handles references 
to unknown types of data by allowing the data to be 
downloaded to the user's computer, and then optionally 
invoking an external program to view or manipulate the data. 
Recent releases of Netscape Navigator and Microsoft Internet 

20 Explorer take the concept one step further: a browser 

extension, or "plug- in," can be automatically invoked to 
handle the data as it is received from the remote Web page. 
Other means, such as network program "applets" written in 
the Java language (or a similar language), can be used to 

25 extend the functionality of the browser environment or 
network. 

Digital multimedia data can have extremely high storage 
and bandwidth requirements. In particular, video files can 
be very large, from approximately 10 megabytes to 10 
30 gigabytes. In order to play video files at speeds 

. approaching their recorded rate at a user's terminal, the 
files have to be delivered at a fast, constant speed. Too 
slow, and the image plays back slower than originally 
recorded. If the speed is uneven, then the video appears 
35 jerky, like an old-time movie. 

The network design compromises discussed above 
generally adversely impact the transmission of audio and 
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video data across the Internet. While a user using a 
browser to "surf" the Web might not notice minor delays and 
transmission rate variations while retrieving text and still 
images, such defects become apparent and significant when 
real-time audio and video information is accessed. 

In an attempt to solve these problems, Internet content 
providers sometimes spread popular content around the 
Internet on various servers or delivery sites known as 
••mirror sites." Each mirror site contains information that 
is essentially identical to that of the original site. For 
example, if a popular Web site is located in New. York, 
mirror sites might be located in Los Angeles, London, and 
Tokyo. Accordingly, if a European user is having difficulty 
accessing the original New York site, he can hotlink to the 
mirror site that is geographically closest, i.e. London. 

However, mirror sites have several disadvantages. For 
example, mirror sites may be widely distributed 
geographically, but may not be efficiently distributed on 
the network in terms of actual usage , network traffic, etc. 
Thus, New York and Los Angeles mirror sites might both be 
connected to the same national Internet service provider's 
network, meaning that difficulty in accessing one of the 
sites might also affect the other. 

Furthermore, mirror sites might not be optimally placed 
to reduce load on each server . Although an "educated guess" 
might be made as to where a mirror site should be located, 
actual usage patterns might differ. Furthermore, there is 
no guarantee of enhanced performance. The bandwidth of the 
mirror site might be lower than that of the original site, 
30 or it might be overloaded for other reasons . 

Moreover, mirror sites are often hosted on a voluntary 
basis. If a Web site is extremely popular, and a service 
provider determines that the subject matter might be of 
interest to its subscribers, that service provider might 
agree to host a mirror site of the original Web site. Such 
an arrangement would be attractive to host of the mirror 
site because people would be drawn to the mirror site, and 
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might hotlink to other content hosted there. On the other 
hand, such voluntary alliances typically are not reliable 
and might be severed at any time. 

In essence, a mirror site offers a secondary source for 

5 data, which may or may not be available, and which may 

improve user convenience, but which does not address network 
bandwidth or efficiency. A mirror site does not account for 
performance characteristics of the network, nor identify 
available bandwidth which could be used to efficiently 

10 transmit video data while still taking advantage of the 
existing low-cost pricing schemes such as those on the 
Internet. 

Currently, there is no guidance in selecting optimal 
locations for delivery sites, nor is there a known method 

15 permitting a user to determine which mirror site to connect 
to that will ensure optimum performance. In fact, the use 
of a traditional mirror site is voluntary. Typically, a 
user will try to access the original site (or a known mirror 
site) , and will switch to another mirror site only if 

20 performance is found to be insufficient after one or more 
attempts. This approach is an inefficient utilization of 
network resources. Clearly, mirror sites are not an optimum 
solution to the problem of overloaded Web sites. A 
principal reason for this, among others, is the failure to 

25 consider network performance. 

Network analysis, particularly the performance of 
specific paths and links over the Internet, is well known 
and developed. For example, the "ping" program allows a 
computer connected to the Internet to determine whether a 

30 remote host is accessible. However, the ping program uses a 
low-priority network protocol known as the ICMP protocol, 
and accordingly does not provide meaningful performance 
analysis information. The "tracer out e M program follows the 
transmission of a message from a computer to a remote host, 

35 tracking delays along each link, and determining the path 
taken by the message. The traceroute application can be 
used to map the flow of data. However, it lacks the ability 



WO 98/18076 



PCTAJS97/19468 



8 

to provide meaningful performance analysis information. 
Traceroute only provides route information for a message 
propagating in one direction, and only for one instant in 
time. 

5 Moreover, only the connectivity characteristics of 

paths leading to and from the single computer running the 
tests are typically determined; expanding the scope of 
testing is possible but logistically impracticable, since 
the Internet is so large. 
10 Traditional network analysis techniques such as the 

"ping" and "traceroute" programs offer a view of network 
connectivity but provide little understanding of what 
performance can be expected from providers and mirror sites 
across the Internet. Therefore, only "guesses" can be made 
15 as to where delivery or mirror sites should be located or 
which mirror sites should be used to optimize performance. 

Accordingly, a need exists for a method of determining 
overall network performance. A further need exists for a 
system applying that method to enable content providers to 
20 dynamically locate data delivery or mirror sites at optimum 
network locations, and to allow users to select optimum 
mirror sites from which to receive data. 

SUMMARY OF THE INVENTION 

25 The invention is directed to a system and method for 

the optimized distribution of Web content to sites located 
around the Internet. An intelligent mirroring scheme, 
called here "Smart Mirroring," is used to determine the need 
for and distribution of mirror sites and to direct user 

30 requests for certain Web content to an optimum mirror site. 

A number of "smart" delivery or mirror sites are used 
to distribute popular Web content to various parts of the 
Internet. A comprehensive scheme of network analysis, based 
on tests performed by a large number of users, is used to 

35 interactively determine the preferred locations for the 

sites, and to determine the optimum sites to be used by each 
individual user. 
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Accordingly, because each individual user is routed to 
a Smart Mirror or delivery site that provides improved 
performance, overall network congestion is reduced. In most 
cases, the improved server is located electronically close 
5 to a user in order to decrease the number of network 

connections over which data must travel, thereby reducing 
packet loss and delay. 

Furthermore, network analysis results allow message 
traffic to be routed away from those delivery sites and 

10 network regions that are already overloaded, and toward 

underutilized servers and networks. This results in an 2 
improvement in throughput as seen by each user, and will 
thereby increase the appeal of the content offered by 
content providers using the system. Content providers are 

15 able to reach a larger number of users across the Internet 
without suffering significant decreases in performance. 

A system according to the invention begins with an 
original Web site and at least one additional delivery (or 
mirror) site. Each user desiring to use the system will be 

20 provided, in a preferred embodiment, with software which 

includes a configuration utility and a client program. The 
configuration utility is used first to determine which 
delivery sites provide improved performance for that 
particular user. 

25 In one embodiment of the invention, the conf iguration 

utility first downloads a "delivery site file" from a 
service provider. This delivery site file contains a list 
of available delivery sites and a list of network tests to 
be run. The types of tests and frequency of testing to be 

30 performed may be specified in the delivery site file, as 

dependent on the number of users testing the network and the 
estimated drain on network or delivery system capacity. 

The configuration utility will run a subset of the 
tests specified in the delivery site file. The test results 

35 show which delivery sites yield improved performance for the 
user, and also contain information on various generalized 
network capabilities from the standpoint of the user running 
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the tests. The network test results and the identity of the 
chosen delivery site will be sent (via e-mail in one 
possible configuration) back to the delivery service 
provider for incorporation into the service provider's 
5 database. 

The delivery site chosen by the configuration utility 
is then used by that user for the retrieval of all content 
managed by the delivery system service provider. 
Consequently, when the user is browsing Web content, and 

10 finds a particular item, e.g. a video clip, that is managed 
by the service provider^ delivery system, the client 
software will automatically retrieve it from the specified 
"Smart Mirror" delivery site. Site preferences and default 
sites can be updated periodically on request, at specified 

15 times, or in response to changes in network load and 
traffic. 

Moreover, because the configuration utility of the 
invention is performing various network tests and providing 
the test results to the service provider, valuable data on 
20 system and network performance is available. Such data 

provides information on which "Smart Mirror" delivery sites 
are performing effectively and which are not, which Smart 
Mirror delivery sites are overloaded, and what portions of 
the Internet might benefit from the addition of more 
25 delivery sites or capacity. 

Such data also makes it possible to perform such 
sophisticated network analysis as end-to-end performance 
measurements, workload characterization, route stability, 
and outage metrics. Thus, from an engineering standpoint, 
30 the mirror service provider can continue to ensure that 

improved performance is being provided. From a marketing 
perspective, content providers can be told where to locate 
Smart Mirror or delivery sites for improved performance, and 
what ISP provides improved delivery. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 is a block diagram of an illustrative network 
topology of a system according to the invention, including 
multiple users and multiple content providers; 
5 FIGURE 2 is a flowchart describing the operation of the 

configuration utility used in a system according to the 
invention; and 

FIGURE 3 is a flowchart describing the operation of a 
client program used in a system according to the invention. 
10 . 

DETAILED DESCRIPTION OF THE INVENTION 

The invention is described below, with reference to 
detailed illustrative embodiments. It will be apparent that 
the invention can be embodied in a wide variety of forms, 
15 some of which may be quite different from those of the 
disclosed embodiments. Consequently, the specific 
structural and functional details disclosed herein are 
merely representative and do not limit the scope of the 
invention. 

20 Referring initially to FIG. 1, the Internet 10, which 

is intended to be representative of wide-area communications 
networks in general, is depicted as a "cloud." The Internet 
is known to be an interconnected network of a large number 
of computers. Although Internet-connected computers that 

25 are "geographically" near each other can be "electronically" 
near each other on the Internet, such is not usually the 
case. However, one computer connected to the Internet can 
communicate with any other computer connected to the 
Internet; the message will most likely travel over a path 

30 comprising a sequence of links, or "hops," between computers 
that are directly connected to each other. 

A first user terminal 12 is also depicted in FIG. 1. 
The first user terminal 12 is connected to an Internet 
service provider (ISP) 14, which is typically just a 

35 computer, router, or terminal server connected to the 

Internet 10. An ISP 14 can host additional user terminals, 
such as a second user terminal 16. Other ISPs, such as a 
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second ISP 18, are also connected to the Internet 10. A 
third user terminal 20 is shown connected to the second ISP 
18. Only three user terminals are shown; however, it should 
be recognized that the number of concurrent users of the 

5 invention is unlimited, subject to the operational details 
set forth below. 

As is known in the art , content providers are also 
connected to the Internet 10. A first content provider 22 
might provide a certain kind of content, for example sports 

10 scores and highlights. 1 A second content provider 24 might 
provide a different kind of content, for example business 
news . 

Traditionally, if a user (such as the one using the 
first user terminal 12) wished to access the content 
15 provided by the first content provider 22, the terminal 12 
would query the first content provider 22 directly. A 
request message would propagate from the terminal 12, across 
the Internet 10, to the content provider 22. The content 
provider 22 would send the desired data across the Internet 
20 10 back to the terminal 12. 

Several delivery, or "mirror" , sites are shown connected 
to the internet 10 in FIG. 1. A first delivery site 26 
might be located a small number of "hops" from the first 
user terminal 12. A second delivery site 28 might be 
25 located further away from the first user terminal 12, but 

close to the third user terminal 20. A third delivery site 
30 might be as close to the third user terminal 20 as the 
second delivery site 28 is. As previously noted, a user and 
a provider or delivery site that are "geographically" near 
30 each other might not be "electronically" near each other on 
the Internet. By decreasing the "electronic" distance 
between the user and the provider or delivery site, the 
number of network connections and routers over which data 
must travel can be decreased. 
35 as discussed above, the Smart Mirroring system acts to 

improve network performance by decreasing the incidence of 
the foregoing network problems. Packet loss" and delay 
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problems are generally decreased by reducing the number of 
network connections over which data must travel, although in 
some cases, the network testing procedure of the invention 
shows that some longer paths provide better throughput than 

5 " some shorter paths. Very little packet loss, and 

essentially no delay, occurs in network cable; it typically 
is caused by overloaded network storage and routing devices. 
Because the Smart Mirror sites of the invention are located 
electronically near each user, packet losses and delays are 

10 reduced. The problem of excessive server utilization is 

reduced because multiple delivery sites share. the load that 
typically would have been handled by a single ser,ver. The 
relatively low capacity of the network infrastructure 
becomes less of a problem, because data retrieved from 

15 parallel delivery sites in different locations generally 
need not travel over the same network links. 

For the purposes of describing this invention, a 
delivery site is a "node" on the network which may store 
data or other files, such as software code, for delivery. 

20 The term can also include a site which is responsible for 
data delivery, including mirror sites, content providers, 
and servers for broadcast video streams or Web sites. 

In the system, a mirror service provider X MSP ) 32 is 
connected to the Internet 10. The MSP 32, which exercises a 

25 management function over the distribution of delivery sites 
26, 28, and 30, and over the allocation of requests to the 
original and delivery sites from user terminals 12, 16, and 
20, includes a database capable of transmitting and 
receiving data over the Internet 10. 

30 This management function is facilitated by the use of a 

conf iguration utility 34 and a client program 3 6 run within 
a storage medium (i.e. random access memory) on the user 
terminal 12. Although the conf igurat ion utility 34 and the 
client program 3 6 are shown in FIG. 1 as a part of only the 

35 first user terminal 12, it should be recognized that any 

user terminal, such as terminals 16 and 20, participating in 
the system will use such software. A user desiring to 
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participate in the system can obtain the software comprising 
the configuration utility 34 and client program 36 directly 
from the MSP 32, or through traditional retail or other 
channels (such as being part of the browser or operating 
5 system of the computer) . It should be noted that the 

functions performed by the configuration utility 34 in the 
described embodiment of the invention can be integrated into 
general Internet application software, such as a browser or 
other network application; a stand-alone program is not 

10 necessary. 

In a preferred embodiment, the configuration utility 34 
must be run by the user, either by command or automatically, 
before the user terminal 12 will have access to the system. 
The operation of the configuration utility 34 is shown in" 

15 detail in FIGURE 2. 

The configuration utility 34, when first run on the 
user terminal 12, retrieves a delivery site file (step 40) 
from the MSP 32 (FIGURE 1) . If the user already has a 
delivery site file (e.g., it was received with the 
20 configuration utility 34), and that delivery site file is 
sufficiently new, the delivery site file can be retrieved 
from the local hard disk of the user terminal 12 . This 
delivery site file contains a list of all available delivery 
sites (such as delivery sites 26, 28, and 30) and a list of 
25 network tests to be run at the user terminal 12. In the 
context of the invention , there can be as few as two 
delivery sites, or if the number of users justifies it, as 
many as several thousand. The number of sites in principal 
is unlimited, with each available delivery site represented 
30 in the delivery site file. 

The delivery site file is generated by the database 
from within the MSP's computer system. The database 
application uses information about the user to dynamically 
determine the optimum tests to run. Consequently, the 
35 delivery, site file need not contain entries for every 

delivery site in existence; the list can be tailored to 
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include only -those sites which appear appropriate or 
feasible. 

Initially, the magnitude of run-time variation in test 
configurations for the delivery system users is low; that 
5 is, the first group of users all run essentially the same 
tests; As the delivery service grows, however, the 
intensity of each user's testing is reduced in order to 
compensate for the increased magnitude of testing network- 
wide. The scope of testing and the number of delivery sites 
10 tested both can be narrowed to further reduce the aggregate 
load of network testing. 

In one embodiment, the delivery site file will have a 
format generally as follows: 

1. File Revision Number and Message. The file 
15 includes this field to determine whether a new version 

of the configuration utility 34 is available. If the 
revision number in the delivery site file is higher 
than the version number for the configuration utility, 
configuration is not allowed. Instead, the user would 
20 be prompted to acquire a newer version of the 

configuration utility 34. File revision verification 
as described herein ensures that the most up-to-date 
delivery site selection algorithms are applied to the 
test data generated by the configuration utility 34. 
25 2. A list of available Smart Mirror delivery 

sites. For each available delivery site, the following 
information is provided: 

a. Host name. In the known Internet format 
of "www.server.com." 
30 b. IP Address. A numerical Internet 

address in the known format. The address is 
presently a 32-bit number of the f orm w.x.y . z . , 
where w, x, y, and z are each in the range of 0 to 
255. 

35 c. Alternate Name. An informal name such 

as "The First Mirror Site." 
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d a list of tests to be executed. For 

each test, the following *J^^> 
i Test ID. Each type of test has a 
unique" identifier known to the configuration 

utility 34. " 

ii. weighting factor. Each test will 
be weighted by a specified percentage. 

iii Frequency. Each test is not 
necessarily ru* every time. This field 
specifies a probability, determining how 
often a particular test will be run 

iv Additional Information (optional) . 
^ For certain tests, additional information may 

be needed. 

e site Preference Level. Each site can be 
given a weighting, or preference level between, 
for example. 1 and 100. As discussed below 
aggregate data in the HSP-s database is used to^ 
perform networ* usage analysis not possible with 

Lly the single user • s instantaneous end to end 
testing The weighting factor provided here is 
: e si: ro incorporate the test results received from 
the service provider's database. Th!s weighting 
factor is also used to li-.it assignment of new 
users to a delivery site once a predetermined 
maximum usage level has been reached. 

f Test Site Flag. If this flag is 
enabled, the foregoing tests will be run, but the 
site will not be assigned as a delivery 
If it yields the best performance. 
30 g content Provider Groups. Each site can 

belong' to one or more content provider groups , 
thereby mirroring only certain content If a user 
is not interested in the types of data hosted by a 
Particular delivery site, then it does not need to 
be tested. 
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The configuration utility 34 then queries the user 
(step 42) for various items of information needed in the 
configuration process, for example, the user's name, e-mail 
address, password, modem speed, and information related to 

5 access control (e.g. what levels of various attributes are 
viewable by the user) . The access control mechanism will be 
discussed in further detail below. In one embodiment of the 
invention, the information received from the user is 
encrypted and stored in a configuration file on the user 

10 terminal 12. 

The configuration utility 34 then determines whether 
the user terminal 12 is connected to the Internet (step 42) . 
If not, it will initiate a connection (step 44) or prompt 
the user to do so. 

15 A series of network tests is then performed (step 46). 

one or more tests can be performed for each available site 
listed in the delivery site file; not all sites in the file 
need to be tested. 

The following test types are presently considered to 

20 provide useful data: 

1. Ping. Provides information on whether a 
remote server is reachable, and if so, how long it 
takes for a low-priority message to travel round trip 
from the user terminal 12 to the remote server and 

25 back. Ping is a simple test useful in deciding whether 

a site is available for further evaluation. Excessive 
times returned by the ping application can be used to 
eliminate delivery systems which are far too "slow" for 
effective information delivery. This test is used by 

30 the terminal to reduce the number of delivery sites 

tested. 

2. Tracer oute. Provides information on what 
route is taken by a message from the user terminal 12 
to a remote server, including what systems are used 

35 along the way, and how long each hop takes. Traceroute 

is used by the configuration program 34 to document the 
path of information transmission. Several traces with 
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differing results might indicate that the stability of 
^e rout! fro* a particular user to a specific server 
is not acceptable. Previously aggregated data on 
Particular routes, fro. the service provider's system 
database , may also influence the decision to choosea 
particular delivery site for a specific user. Route 
stability is the primary consideration. 

3 Reverse Traceroute. Provides information on 
what route is ta*en by a message from a re~te server 
to the user terminal, including what systems are used 
alo^ the way, and how Ion, each hop ta.es. Reverse 
Traceroute is used by the conf iguration program to 
document the path of information receipt. ™> 
«aces with differing results might indicate that the 
rtability of the route from a particular server to a 
specifier is not acceptable. 

data on particular routes, from the serv ~ ovider 
system database, may also influence m 
loose a particular delivery site for a specxfi user, 
choose p _ primary consideration. 

20 Again, route stability P similar to traC eroute or 

4. Dynamic Traceroute. Similar ro ^r . 

traperoute but between any specified pair of 
reverse traceroute, , ne . e d 
computers on the Internet. Dynamic Traceroute is used 
by Le configuration program to document a path of 

i^drsn several traces with 
information ™^ n ^ ^ stability of 
differing results might inaicauc 
■ . ■ ■ the route between two networK locations is «* 

acceptable. Previously aggregated data on P«txo-« 
routes from the service provider -s system database 

, „ influence the decision to choose a particular 
3„ My alS ° ln " U ^ a 3 PeciIic user. As above, route 

delivery site for a specu in- 
stability is the primary consideration. 

5 Name Server Resolution Delay. If the numeric 
internet address is unspecified, a name server loofcup 
is performed to determine what numeric address 

corresponds to the desired host name. This process can 
take a substantial amount of time. 
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6. Throughput. A sample file is downloaded, or 
partially downloaded, from the remote server to 
determine the actual throughput in bytes per second. 

7. Throughput variation. A sample file is 

5 downloaded, or partially downloaded, from the remote 

server to determine if the throughput is relatively 
constant or fluctuating. 

8. Error rate. A sample file is downloaded, or 
partially downloaded, from the remote server to 

10 determine if the transmission is subject to 

transmission errors. This information is obtained by 
counting the number of error message packets returned 
to the user, thereby learning of the error rate from 
the user to the server, and by tracking the number of 

15 bytes received as a fraction of the number of bytes 

transferred, thereby learning of the error rate from 
the server to the user. 

9. Packet Fragmentation. A sample file is 
downloaded, or partially downloaded, from the remote 

20 server to determine it the transmission is subject to 

fragmentation or out-of-order packet reception. 

10. Capacity Query. If the remote server is so 
enabled, the configuration utility 34 queries the 
server to determine its transmission capacity and its 

25 average load. This information is collected via the 

Simple Network Management Protocol ( W SNMP" ) , which is 
supported by nearly all Internet servers. 

11. Macroscopic Network Analysis. The data 
accumulated by the MSP database offers a global view of 

30 network behavior. This information permits the Smart 

Mirror system user to have a historical view of the 
performance of the available delivery sites. The 
accumulated data is manipulated by the delivery system 
database to analyze network performance, in order to 

35 emphasize usage in high-capacity areas of the network, 

while deemphasizing usage in areas already experiencing 
diminished performance. 
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• nhnv each of the foregoing tests are 
Informatxon on how each ana i ys is. 

performed is ^^T^*.. Hin****-***^^ 

See, e.g., Bob Quxnn and ° ave _ q embo diment of the 

invention, testing rea chable, a series of 

to verify whether a server xs , noK \ 
test to verxiy ser ies of large (e.g. 200K) 

. ' to information in res 

correlate test da ta *° and serve rs can he 

ua y, particularly bad network ^ 

identified. Such ^Lticular linK or server is 

si te file disc* ssed *™«-JJ J^^tion obtained fro* 

S -own to be -«^; u :ruser can be routed away fro. it, 
other users, an xnaxvx«"« - 
even if a single test f^^^^ -rver 

- Sh ° rt ^rrerverreriution delay can be 
capacity. The n3 >! test as well as the time it takes a 
20 determined by such » tes *' result is strongly 

server to begin sending data *» ^ 
related to server load, capacity and , „ 

T he long downloads allow the con ^ ^ ^ 

to determine how^acket ^ . - ^ ^ ^ ^ „ 
25 utilisation affect file a caus ing 

aetermine which of the * °regoin9 ^ from a single 
.creased performance ^ - - ~t ~ ^ „ ^ irt 

r^T^S- * - - „. indicate the root 

" ~~ should be noted that 

u sed in conation with o«,er ^^f^ . capaclty 
the load on a delivery site as det t est to 

— can be "^^T^ tiB e. given the load 
35 derive an average expecxea 

characteristics of the server. 
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After all specified tests are run, the results are 
collected and processed (step 48) • It is possible that 
certain tests were not able to be successfully performed; in 
such cases, the results should indicate an appropriate 
5 worst-case value (e.g. zero throughput or extremely high 
delay) . 

It is important to note that not all possible tests 
will be performed each time the configuration utility 34 is 
run. When a large number of users is using the system, a 

10 substantial drain on server and network capacity would be 
caused by the testing procedure alone, increasing the 
downward spiral of network performance previously discussed. 

As noted above, a test frequency number is stored in 
the delivery site file for the purpose of dynamically 

15 controlling the number of users performing a test. The 
testing performed by the configuration utility 34 is 
performed to achieve statistical confidence in deciding 
which delivery site is best suited for data delivery to a 
particular user. Statistical confidence is obtained by 

20 either testing a small sample of users sufficiently well and 
using that data to influence the choice of a delivery site, 
or by having a large number of users each "lightly" test 
several available sites and using that data in the 
aggregate . 

25 Accordingly, when the system is used initially, a 

relatively small number of users are "enrolled" in the 
system. The delivery site file maintained by the MSP 32 
reflects those conditions, and requires each user to test 
the network (through the configuration utility 34) 

30 relatively heavily. As the number of users increases, the 
delivery site file is modified to decrease the tests 
performed by each user. By the time a very large number of 
users are using the system, the configuration utility 34 may 
predominantly test for delivery site reachability (via a 

35 "ping" -type test, as discussed above) , and rely primarily on 
test data provided by other users and stored in the database 
maintained by the MSP 32. However, even when many users are 
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» small number of users (e.g. one in 
testing the system. . small set 

5,000, may be s* 1 ^* 0 ™ uld not contribute more than 
Preferably, testxng^shoul^ ^ 0ne way to reach 

proximately 5% « . o ^ MiBber Qf servers _ 

this goal is to lightly ^ aaeguate 

yielding a delivery sites can then be used 

performance. Thrs gro p In£orma tion on multimedia 

in rotation to eao „ o£ the delivery sites in 

cli p actual download times * ^ 

, the group is 'Y^^ormance can then be 

information on delivery site p _ t ^ ne ed for 

furnished to the HSP 22 transparently. 

further outright testing. ^ ^ c<>llected test 

xccordingly. on t*e * ^ ^ site 

15 results, and on ln£ ° raatl °" f * ratlo „ utility 34 determines 
me by the MSP 22 the =™»- livery sltes . is best for 
whichdeliverysite. or g f dete rmination can be 

the user terminal 12 " various tests performed 

made numerically by weighting the^ 

20 and comparing ^ embodi ment. for use in a low- 

In a presently preferr number o£ 

^f fic ;f ^^rclf i/uration utility 3< 

delivery sites and users, t . £or eacn 

relies primarily on ping and ^ „ per£orme d 

B available ^^"^ site is reachable, short and 

to aete ^ ne d 1 B £ ^ e a :^ £o ^ed in the throughput tests to 
long downloads are perfo ^^puts from the 

aetermine the maximum and ^ ^ „ 

delivery *° oommodate the transmission of video 

3D is small enough to accomm ^ h . gh wei9nts . 

d ata. ^^^i:"^!, can be performed, and the 

Other tests, such as playing a role in 

results reported to the n . be given 

the Choice of -^y^ £ or Sample, . *s the size 

35 weights of zero, or ltional delivery sites and 

of the system ^ £or mula can be 

users are enrolled, the sn- 
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altered, by changing the delivery site file contents, to 
reflect changing network conditions. 

In one embodiment of the invention, a proprietary 
graphical interface is provided so that the location of the 

5 user and the locations (both geographic and electronic) of 
each site tested can be displayed on a monitor connected to 
the user terminal 12, allowing a visual indication of the 
relative distances between sites. In one embodiment, the 
display is shown in the form of a "radar screen," upon .which 

10 the user terminal 12 and delivery sites are displayed as 
"blips" superimposed over a map of the pertinent 
geographical region. In order to encourage the user to use 
the application and to offer more network-wide data, the 
user interface can allow the user to enter an "ad-hoc" test 

15 site for additional performance testing. In this case, the 
configuration utility will test either the default Web page 
file (e.g., "index.html") or a specific file requested by 
the user. Analysis results from the user-selected site are 
adjusted so that reasonable comparisons can be made with 

20 results from other sites. 

It should be noted that multiple sets of delivery sites 
can be maintained by the invention, to accommodate several 
groups of content providers. Each content provider might be 
mirrored only at certain sites. Accordingly, for each 

25 content provider having a unique set of delivery sites, a 
primary delivery site is selected by the configuration 
utility 34. To accomplish this, the foregoing tests can be 
run once, and then, in one embodiment, a numerical weighting 
can be applied to each appropriate set of delivery sites. A 

30 plurality of Smart Mirror sites is selected, one for each 
content provider group. The content provider groups is 
specified in the delivery site file; each possible delivery 
site is identified as belonging to one or more content 
provider groups. When content provider groups are used, 

35 there can be as few as two groups; the maximum number is 
essentially unlimited. 
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• r-itized ranking or 
It should also noted ^ '/"o If 
deli ,« si- can also -J^£ SMr t Mirror site « 
• j„„e failure of the pr ^ nex t- 

1S ^f'uov the system to fall 

site , and all ^ ^ „„, 1S c=»P ^ . pt 

_ „ msp 32, the data is 
ty rid analog the system. tte identity 

«- — ^t^r^each set of delivery 
* the selected Smart Mirror (encrypted) 
tit" or the prioritized list. « ^ £igaration utility »ay 
conflation file (*- ^ • ^ ^^ce for each 
also save information pr69rM " uses ^e 

tested delivery site. „ dovm ioad data til" I 

clips or other ^ ^ 

Site ' ia be noted that in the °P er ^°" 32 main tains 

Xt should be . . n functions. The MSP 

information on the per Varlt) us data P 

the internet and such information 

techniques « ^^U^ sites ^ ^ 
The locations of th by a number of fa 

v„« are ultimately determm , _ ost/ benefit 
invention -~ ^derations and cost/ 

including marKetmy 
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analyses. However , the data stored in the MSP's database 
can confirm the utility of placing a delivery site at a 
given location on the Internet or other network. In one 
embodiment, servers are located on each major backbone 
5 (portion of the Internet maintained by a single corporation) 
and on other Internet lines serving large numbers of users, 
such as the major lines operated by the Regional Bell 
Operating Companies ("RBOCs") . In certain networks serving 
large numbers of users or having heavy video delivery 

10 traffic, servers can be placed at major Points of Presence 
("POPs") for the network to ensure that each user has 
excellent to a fast server. 

Once the configuration utility 3 4 has been run, the 
user can use the system to enable and facilitate the receipt 

15 of data files, specifically video clips, audio clips, 
software programs, and other content. 

As time passes and the usage patterns of a user's 
region of the Internet change , the user might become 
dissatisfied with the performance of the Smart Mirror site 

20 associated with his user terminal 12. If that happens, the 
user is free to re-run the configuration utility 34. By 
that time, additional delivery sites might have been placed 
into service, or a different pre-existing site might perform 
better than the one previously assigned. Furthermore, if 

25 the player program 3 6 determines that the selected Smart 
Mirror site is not performing adequately (e.g., it has 
failed three times out of ten attempts) , the player program 
36 can prompt the user to re-run the configuration utility 
34. In other embodiment of the invention the testing and 

30 mirror assignment is run automatically with each request for 
a file on the Smart Mirror service or at some intermittent 
times such as after every other, every third, every tenth or 
every one hundredth request. 

In one embodiment of the invention, the Smart Mirror 

35 system is used to locate a delivery site from which to 

download a video or audio clip ("clip") referenced on a Web 
page. In this embodiment, the client program can be 
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referred to or cons ^ * '/^t the functions of the 
program, i" addition to an a playback of 

P 0 Uent program 36. enable, ^ ^ „ on the 

viae odata. ordinarily, a* r Brows er programs 

US er terminal 12 to view «eb con navigator, and 

locally used include -^J^ P prog ram 3, allows 

Microsoft internet ^"^J^ « sites on the Interne^ 
the user to hotltnK »»° n * /*" HTHIj docu ments to indicate 
Th e E««D tag ^ »-^JU- -V tne system. — 
0 which Web pages include content con t a inin, an 

the browser program 38 referenced by the ta, is 

EMBED tag . ' a download of the fx Jf the fl le is of 

fenced, and the file type MPEG, the 

. type handled by the P££ £ g ^ The 

B browser progr^ ; ^ed by the browser program 

contents of «xe ^ y 
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The Player progr am 36 ^ ^ The operat.cn 

Mirroring services ln aet ail in FIGURE 3. 

20 cf the player program 36 « . ^ ^ed tag to 

The player program first analy (step 
aetermine if there is an lnalcates that the 

60) ; the presence o£ such p t D ata 

embedded clip i= enabled for Smart * ^ partl cular 

25 associated with the "S»« P*™f cUp originated, as 

content provider f rom^ch ^ COTt ent 

well as the group of mirror s 

provider uses. 36 determines that the EMBED tag 

K the player program " „ t hanaled by the 

,0 references a video clip or <^"°^ clip from the 

30 system (step 62, , the transf er of t J6 ^ 

content provider 22 is Ration from the WD 

extracts access control ^ rat ing information 

sta tement (step 64) . if «* •» levels stored in the 
J5 is compared against the terBinal 12 (step 66, . 

configuration file ^^exist for the clip, the 
I£ rating information does not exi 
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configuration file is queried to determine whether unrated 
clips, as defined below, may be played (step 68) . Based on 
the foregoing information, the player program 3 6 will 
authorize or decline the viewing of the desired clip. 
5 If playback is authorized, the player program 3 6 

attempts to find the referenced clip on the local computer 
belonging to the user terminal 12 (Step 70) . If it exists 
there, it is not re-downloaded, and can be played directly 
on the computer (from the disk or from RAM) (step 72) . 

10 However, the time and date of creation of the clip on the 
local computer is first verified against the time and date 
for the clip available on the network, to determine if the 
stored clip is the most recent version (step 74). If it is 
not, the stored clip is discarded (step, 76) and the download 

15 proceeds as follows. 

If the clip does not exist on the local computer, the 
player creates a new URL (step 78) in the following form: 
"http://" , plus the IP address of the selected Smart Mirror 
site stored in the configuration file, plus the path name to 

20 mirror files (e.g. "/pub/ mirror/") , plus the name of the 

content provider taken from the "SM" parameter in the EMBED 
statement, plus the filename taken from the EMBED statement. 
The constructed URL is used to retrieve the selected clip 
from the appropriate Smart Mirror site selected by the 

25 conf iguration utility 34 (step 80). If more than one set of 
delivery sites exists for different content providers, the 
M SM" parameter is further used by the player program 3 6 to 
determine which Smart Mirror site in the configuration file 
is to be used in the constructed URL (step 82). 

30 If the clip corresponding to the constructed URL is not 

found at the Smart Mirror site, or -is unable to be accessed, 
then the download proceeds from the next-highest ranked 
Smart Mirror site in the configuration file (step 84) . If 
all delivery sites fail, the download proceeds from the 

35 original content provider's site as specified directly by 
the EMBED statement. 
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If playback is disallowed, the player prevents the clip 
1 rf.r^d (step 88) and displays a bitmap (step 
fro- being transferred (step^ ^ ^ ^ 

901 a " pUyeTpro^ ^determines that the ^ ta, 
r eferira P vided clip or other content not -d!ed by the 

! tka olaver will check whether the access control 
ZTsetlJZ Configuration file --s the user t play 

Iti; "srana :lspl!ys a message (step 100, advising the 
— — ~ rT^arfUr^esenting the desired 
5 up r storTd -th": s r oi £ f ied daL area on the local 
t^er ^ally on the hard disk, belonging to the user 
r i, (step 102). in one embodiment, this data area 

terminal 12 (step ) fc basls by ^ player 

can be managed on . , le t recently^ ^ ^ ^ ^ 

20 rnew^iip: - ^ - 

v~ ri bearded to make room (step 104). 
clips can be discarded ^ m cl ient program 

in one embodiment of the invention, the clien P g 
36 is capable of sending messages to the MSP 32 (step 106) 
25 t o reflect whether downloads were successful. This message 
" ^ ^ ,, vacc of? t^e user terminal 12, tne 

contains the ^ Inte rnet address of 

identity of the selected server set^ 

the site used to accomplish the downio _ 

* .n sites which failed, the name of the file 
addresses of all sites wniu» 

30 downloaded, and the time to download ^ ' 

information can also be used by the MSP 32 ^ 
downloads and to determine, in real tame, whether 
any problems with any smart Mirror sites. 

y p „oorai 36 can maintain a 

Alternatively, the client program 

35 ^11 locax database of file ^£^™7Z can 

download would ^^^^T^ £ begin sending 
be gathered on the time it taKes a 
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the requested file, the stability of the data transfer rate, 
and the error rate of the transfer. At some interval (e.g. 
weekly or once every 100 downloads) , a message containing 
the accumulated file transfer performance information, as 
5 well as the user and server information discussed above, 

would be sent (automatically or upon request) to the MSP 32 
(step 106) to update the MSP's database. This additional 
information increases the MSP 1 s "knowledge" of network 
performance without incurring any additional testing 
10 overhead. 

This data is especially valuable in ascertaining the 
performance of delivery sites, for the purpose of assessing 
the quality of service purchased from the delivery site 
provider, and for documenting the quality of service to 
15 content providers, to support the cost of the system. It is 
recognized, however, that much of the same information can 
be obtained through new users running the conf iguration 
utility 34. 

Furthermore, the provision of download information to 
20 the MSP facilitates the use of the invention as a premium 
subscription-based service. As successful downloads are 
tracked in a database , each user can have an associated 
"account" to track charges. The user can be charged for use 
of the Smart Mirror system by the file, by the megabyte, by 
25 the month, or by other known means. In one embodiment, the 
EMBED tag associated with a file contains billing 
information, or a "price" for the file. The invention" s 
tracking of download performance allows discounts or credits 
to be issued if downloads are found to be unduly difficult 
30 or slow. 

To ensure that files stored on Smart Mirror delivery 
sites are used only by authorized users of the invention 
(e.g. those users having paid accounts) , the files stored at 
the delivery sites can optionally be in encrypted form, and 
35 the downloading step described above can include a 

decryption step. Such encryption and decryption can be 
performed by well known means. 
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typically would be invoked at the user terminal 12 to view 
or use other types of data. 

It should also be noted that certain functionality 
described as performed at the user terminal 12 
(specifically, certain functions performed by the 
configuration utility 34, or client/player program 36) can 
be implemented as a standalone program, as a "plug- in" or 
"helper application" to run within a browser program, or as 
a Java applet downloaded from a delivery site to run within 
a browser environment. For user terminals capable of 
running the Microsoft Windows operating system, an 
environment known as Microsoft ActiveX is also useful. 

While certain exemplary structures and operations have 
been described, the invention is not so limited, and its 
scope is to be determined according to the claims set forth 
below. 
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Wbat is claimed is: ^ a delivery sit., seated on 

x . A system for sele 9 Q a rece . ve a 

2 a distributed computer network. 

3 tile, the system comprising: 

a c°-™ iMti ° n * "provider connected to the network, 
5 at least one content P«« 1Mst „ ne fUe; 

« wherein the ^"^^ connected to the network. 

^ le TdeU^y sZ stores the same rile as the 
wherein each delivery 

content provider; network; 

a user terminal <=°""^ £or testlng tne network and 

* netW trere«ed 9 d a eU:e« site from all delivery sites 
selecting a preterre 

connected to the network; an din9 ^ fUe from 

a download^ • ^ user ter mi„al. 
the preferred delivery si 

' . to cla im 1, wherein the network 

a . A system storage medinm connected 
testing 

apparatus come* — medium 

to the user terminal, « her *"\ for testing the network 
^ies a first software progra^f ^ 
and selecting a preferred delivery 

sites connected to the network. 



8 
9 
10 
11 
12 
13 
14 
15 



1 

2 

3 
4 
5 
6 



1 

2 

3 
4 
5 



3. i^-^r^^s,- 

downloadin, apparatus compr^ ^ ^ 

connected to the user program capable of 

M dium embodies a second software PJ ^ ^ ^ 

downloading the file from the prefer 

6 user terminal. 

4 Asys te. according - clai, 3, -rther comprising 

\ a database connected to the network. 

™-dina to claim 4, wherein the first 
5 . A system accord^ to ^ ^ by 

software program uses a list o 



1 
2 

3 the database. 
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1 6. A system according to claim 5, wherein the first 

2 software program' uses results from a network analysis 

3 procedure to choose the preferred delivery site from the 

4 list of delivery sites. 

1 7. A system according to claim 5, wherein the first 

2 software program uses results from a network analysis 

3 procedure to create a prioritized list of preferred delivery 

4 sites. 

1 8. A system according to claim 6, wherein the network 

2 analysis procedure is performed at least in part at the user 

3 terminal. 

1 9. A system according to claim 8, wherein the network 

analysis procedure comprises a plurality of network tests. 

1 10. A system according to claim 9, wherein the network 

2 tests are selected from a list of tests maintained in the 

3 database. 

1 11. A system according to claim 10, wherein the list 

2 of tests includes a download elapsed time test. 

1 12. A system according to claim 11, wherein the 

2 elapsed time test is normalized by an average load 

3 experienced by each tested delivery site. 

1 13 . A system according to claim 9, wherein the network 

2 testing procedure yields a numerical performance rating for 

3 each tested delivery site. 

1 14. A system according to claim 13, wherein the 

2 numerical performance rating is calculated as a weighted sum 

3 of results from the network tests. 



1 

2 
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15 A system according to claim 14, wherein the 
lighted sum is calculated using weights P-.ded «« 
network test by the database. 

16 a system according to claim * , wherein results 
16. A syst pro viaed to the database. 

from the network tests are pr 

17 a system according to claim 16, wherein the 
results are stored in the database. 



2 

1 
2 



1 
2 



„; A system according." ^ ^^^^ 
software program performs the networX 



with the user terminal 



te m according to claim 18, wherein the second 
19 . A system d in conjunc tion with 

software program performs the down 



1 

2 

3 the user terminal. 



20 a system according to claim IS, wherein the first 
2 software program can be executed at any t!me. 

2 software program is executea * 

3 is downloaded. 

22 . A system according to claim 1, wherein the file 
2 comprises a video clip. 

-1,-s™ n wherein the second 

23 . A system according to claim 7 wher 
2 software program uses the prioritized list. 

24 . a system according to cla^ : 
software program first attempts to use 
specified in the prioritized list. 



1 
2 
3 



1 
2 



first delivery site fails, the s 
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3 attempt to use succeeding delivery sites specified in the 

4 prioritized list. 

1 26. A system according to claim 25, wherein if all 

2 delivery sites in the prioritized list fail, the second 

3 software program will attempt to download directly from the 

4 content provider. 

1 27. A system according to claim 12, wherein the 

2 average load is monitored by each delivery site, and a 

3 number corresponding to the average load is stored by each 

4 delivery site to be accessible to the network. 

1 28. A system according to claim 1, wherein the file 

2 has characteristic information associated therewith. 

1 29. A system according to claim 28, wherein the 

2 characteristic information includes information specifying 

3 that the file is stored on at least two delivery sites. 

1 30. A system according to claim 29, wherein the 

2 characteristic information identifies the content provider. 

1 31. A system according to claim 28, wherein the 

2 characteristic information includes content rating 

3 information . 

1 32. A system according to claim 28, wherein the 

2 characteristic information is stored within an "EMBED" 

3 statement corresponding to the file. 

1 33. A method for determining a set of preferred 

2 delivery sites from a plurality of delivery sites on a 

3 network, comprising the steps of: 

4 obtaining a list of delivery sites; 

5 performing at least one network test; 

6 processing results from the network test; and 
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prio riti,in 9 the delivers accordin, to the - 

results . 

„ , olaim 33, wherein the performing 
«• — - th ° d : £ .!" ! f the delivery sites in the 



list. 



34. The method of <* m '^ delivery sit es in the 
steP comprises testing some of the 

■ 34 wherein the performing 

35. The method of claxm 34, 

step comprises: livery sites from the list; and 

selecting a suhset of of the de livery 

4 running at least one test for 

5 sites in the subset. 

x 36 . ^^.-«^--^~ t - tl * 

selected fro» a plurality of tests. 

, ,6 vherein the plurality of 
» • — .!t:r in ° f a Xhase' connected to the network. 
2 tests is specxfxed in a 

-,,^37 vherein the test is 
38. The method of claxm , ^ based 

2 selected from the plurality hy Y 

3 on a frequency specxf xed xn the 

* 1a 4™ 33 further comprising the 
39 . ™°***Xs^s l° a' database connected to the 

2 step of sending the result 

3 network. 

, 33 wherein the processing 

40 . The method of claxm 33, wh 

^4 «* B weighting the results . 
2 step comprxses wexg» 

„ , claim 40, wherein the weighting step 

41 . The method of claxm 4 , ^ 
2 uses weights specif xed xn the 

* i»t» 33 wherein the prioritizing 
42 . The method of clax 33* ^ ^ & 

^ r ; ges storing the seT; 

2 step comprises & ^ 

3 preferred order. 
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1 43. The method of claim 42, wherein the preferred 

2 order is determined by the test results. 

2 44. The method of claim 33, wherein the list of 

2 delivery sites is obtained from a database. 

x 45. A method for downloading a file from a delivery 

2 site, comprising the steps of: 

3 identifying a desired file; 

4 determining whether the desired file is mirrored on one 

5 or more delivery sites; 

6 if the desired file is mirrored, identifying a 

7 preferred delivery site from a prioritized list of 

8 delivery sites; and 

9 downloading the desired file from the preferred 
10 delivery site. 

1 46. The method of claim 45, wherein the retrieving and 

2 downloading steps are repeated until the downloading step is 

3 successful . 

1 47. The method of claim 46, further comprising the 

2 step of downloading the desired file from a content provider 

3 site if the repeated retrieving and downloading steps are 

4 unsuccessful. 

! 48. The method of claim 45 , further comprising the 

2 step of verifying file authorization after the determining 

3 step. 

1 49. The method of claim 48, wherein the verifying step 

2 comprises the substeps of: 

3 checking if the desired file has a content rating; 

4 if the file has a content rating, comparing the content 

5 rating against a standard specified by a user; 

6 if the content rating is acceptable to the user, or if 

7 the clip has no content rating and unrated clips are 
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arizing the downloading 
accept to the — author^ 

step; and downloading step, 

otherwise declmmg the 

, •„ 49 wherein the downloading 
x - 50. ^^^nru^orizea. 
2 step is Perked only xf * 

. 45 further comprising the 

step of storing the a 

user terminal. .„ rtBte p 

1 . 51 wherein the storxng step 

52 Th e method of claxm 51. 
comprises the suf f ic ient room in the data 

determining tnex *iie; 

U there .» » ^ aata area; and 

recently-ised f data area, 

savin, the downloaded 

S " PS to «- do—, -p. ***** ~ ^ 

Ts"ed £il e 1» -e data -- ^ ^ 

verifying that 

desired lile; and are lden tical. 

u the desired tiW and ^ d o£ downl oadina the 
displaying the stored *.M 
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2 
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5 
6 
7 
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desired fil e - 



1 
2 
3 



desire - — 

■ „ wherein the verify"*, step 
M . ^ method - ^ crMtion £or tbe 

r;.r;i- r r.:.- t u.. 

• A5 further comprising the 
»• *• "^^"ertlce provider of a succeed 
step of informing a mirror 

5 download. 
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1 56. A method for locating and maintaining a network of 

2 delivery sites for the optimized storage and retrieval of 

3 files, comprising the steps of: 

4 receiving network test results from a plurality of 

5 users; 

6 storing the test results in a database; 

7 processing the test results; 

8 determining network performance at various locations on 

9 the network; and 

10 notifying the users whenever a delivery site is added 

11 or removed. 
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